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Abstract 

It  is  oRen  asgued  tba,t  the  presence  of  a  unit  root  in  hggregaXe  output  implies  that  there  is  no 
'business  cycle':  the  economy  does  not  return  to  trend  following  b  disturbance.  This  paper  makes 
this  notion  precise,  but  then  develops  a  simple  aggregative  model  where  this  relation  is  contradicted. 
Id  the  model,  output  both  has  a  unit  root,  and  displays  repeated  short-run  fluctuations  around  a 
deterministic  trend.  Some  summary  statistical  evidence  is  presented  that  suggests  the  phenomena 
described  in  the  paper  is  not  without  empirical  basis. 
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Much  macroeconomic  research  has  focused  on  the  evidence  for  and  the  implicatione  of  unit  roots  in  aggre- 
gate output.  Well-known  examples  of  this  work  include  the  contributions  of  John  Campbell  and  N.  Gregory 
Mankiw  (1987),  Peter  Clark  (1987),  John  Cochrane  (1987),  Steven  Durlauf  and  Peter  Phillips  (1986),  Charles 
Nelson  and  Charles  Plosser  (1982),  and  Mark  Watson  (1986).  Many  have  taken  away  from  this  discussion  the 
view  that  business  cycles,  relative  to  secular  changes,  are  small  and  insignificant:  output  does  not  fluctuate 
around  trend. 

This  paper  has  two  main  objectives.  The  first  is  pedagogical;  it  makes  technically  precise  the  relation 
between  the  presence  of  unit  roots  and  the  phenomenon  of  (lack  of)  cycle  around  trend  in  a  way  more 
explicit  than  is  currently  available  in  the  macroeconomic  literature.  Most  studies  almost  instinctually  take 
the  key  characteristic  to  be  the  implied  path  of  an  impulse  response  function.  (See  for  instance  Campbell 
and  Mankiw,  1987.  In  the  empirical  literature,  a  notable  exception  is  Francis  Diebold  and  Glenn  Rudebusch, 
1987.)  The  impulse  response  may  be  a  sensible  statistic  to  examine  for  many  purposes,  but  it  is  ill-suited 
to  represent  the  absence  of  cycle  about  trend.  The  reason  is  that  the  presence  of  fluctuations  about  trend 
is  a  property  of  the  sample  path  behavior  of  a  time  series,  not  of  its  impulse  response  function.  The  latter 
is  at  best  a  conditional  expectation,  and  in  general  not  even  that,  but  only  a  linear  least-squares  predictor. 
Nevertheless,  there  is  a  precise  sense  in  which  the  presence  of  unit  roots  is  inconsistent  with  fluctuations 
about  trend:  average  cycle  length  is  infinite  when  a  linear  time  series  process  has  a  unit  root. 

The  first  objective  therefore  serves  to  sharpen  discussion  on  what  exactly  are  the  implications  of  a 
process  having  a  unit  root.  That  the  points  made  here  are  important  is  illustrated  by  the  second  goal 
of  the  paper.  We  construct  a  simple  model  of  employment  growth  and  leaming-by-doing.  In  the  model, 
the  conditional  expectation  of  output  is  at  least  as  explosive  as  that  for  a  unit  root  process;  however  the 
sample  path  for  output  aJways  displays  well-behaved  finite  length  cycles  about  trend.  Thus  the  finding  that 
estimated  ARMA  representations  for  output  have  a  unit  root  should  not  be  identified  with  a  view  that  there 
are  no  business  cycles  about  trend.  Some  summary  statistical  evidence  for  aggregate  output  in  the  US  is 
also  presented  that  suggests  that  the  distinction  here  is  not  empirically  irrelevant. 

The  plan  of  the  paper  is  as  follows.  Section  I  is  a  technical  section  that  relates  the  presence  of  unit  roots 
to  observable  sample  path  behavior.   This  contains  arguments  that  would  normally  be  omitted,  except  for 
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the  fact  that  impulse  responBe  functions  have  been  so  convincing  to  so  many.  Section  11  presents  a  simple 
economic  model  where  unit  roots  are  consistent  with  well-behaved  finite  length  cycles  about  trend.  Further 
necessary  technical  results  are  also  developed  here.  Section  III  reports  informal  summary  statbtics  that 
suggest  some  empirical  basis  for  the  main  message  of  Section  II.  The  paper  concludes  with  a  brief  Section 
IV.  Technical  proofs  are  presented  in  an  Appendix. 

I.  Unit  Roots  and  Sample  Paths. 

For  the  most  part,  this  section  will  formally  consider  a  pure  random  walk  with  drift.  The  main  conclusions 
carry  over  to  where  the  first  difi'erence  is  a  serially  correlated  stationary  sequence.  Throughout  the  discussion, 
the  reader  should  keep  in  mind  the  distinction  between  difference  stationary  and  trend  stationary  sequences 
emphasized  by  Nelson  and  Ploaser  (1982)  and  Campbell  and  Mankiw  (1987). 
For  t  >1,  let  Y(t)  be  an  observed  sequence  generated  by 

r(i)  =  /s  +  r(f-i)  +  £(f), 

where  £  is  a  zero  mean  covariance  stationary  sequence,  with  spectral  density  bounded  away  from  zero.  The 
sequence  Y  is  therefore  difference-stationary,  in  the  terminology  of  Nelson  and  Plosser  (1982). 

For  any  tiine  trend  coefficient  a,  the  deviations  process 

f 
W{t;  a)  =  Y{t)  -at  =  Y{D]  +  (^  -  a)  •  f  +  [^  e(t) 

rem.aiiiE  a  difference  stationary  sequence: 

W{0-a]  =  Y{0). 
Thus,  removing  any  deterministic  time  trend  from  a  difference  stationary  sequence  only  produces  yet  another 

difference  stationary  sequence.    The  probability  law  for  the  first-differenced  sequence  remains  completely 

unchanged,  except  possibly  in  mean.    The  first  lesson  can  therefore  be  stated  as  follows:   unJess  to  begin 

with,  there  is  a  presumption  tha.t  some  difference  sta.tionasy  sequence  should  display  cycUcaJity,  arbitrary 

linear  detrending  cannot  induce  spurious  cyclica.lity.      (This  may  of  course  fail  if  the  "trend"  coefficient  is 

continually  being  re-chosen  when  a  given  finite  sample  is  growing  through  time.) 


Next,  write 

f 
Y{t)  =  fit  +  ^cU)  +  Y(0). 

Notice  that  the  right  hand  side  contains  terms  of  different  asymptotic  order.  The  deterministic  part  ^  t,  for 
^  different  from  zero,  is  of  mean  square  asymptotic  order  0{n^)  whereas  the  stochastic  component  is  only  of 
mean  square  asymptotic  order  0{n),  where  n  is  the  sample  size.  Thus,  the  more  likely  avenue  for  deviation 
from  cyclical  behavior  is  the  dominant  deterministic  trend  part.  However,  this  component  is  shared  as  well 
by  trend-stationary  sequences  (data  that  is  covariance  stationary  about  a  linear  trend).  Thus  we  will  ignore 
this  part  in  the  subsequent  discussion,  and  concentrate  on  the  purely  stochastic  component.  But  then  this 
is  exactly  a  rero  drift  unit  root  sequence. 

To  fix  ideas  and  to  simplifj'  the  calculations,  suppose  that  e  is  an  iid  sequence,  which  takes  the  values 
1  and  -1  with  equal  probability.  If  anything,  one  would  imagine  that  this  assumption  severely  restricts 
the  possible  cyclical  behavior  of  the  accumulation  J3^(y)'  once  the  sum  gets  sufSciently  distant  from  an 
arbitrary  starting  point,  it  takes  a  special  sequence  of  events  for  the  process  to  return  to  its  origin.  If  e  were 
to  have  infinite  support  (such  as  for  a  normally  distributed  random  variable),  X^£(y)  can  always  return  to 
its  origin  in  one  step.  The  assumption  of  serial  independence  makes  the  calculations  easier;  relaxing  it  would 
change  none  of  the  conclusions. 

Let  W[t;  P)  =  W{t]  =  y^^-i  £(y):  this  may  be  interpreted  as  the  deviation  of  the  economy  from  trend. 
We  study  its  sample  path  properties  through  the  following  set  of  questions.  First,  for  W[t]  =  0,  what  is  the 
behavior  of  the  sequence  of  conditional  expectations  E  [W[t  -f  m]  |  ^''(t)] ,  for  m  >  1?  In  words,  given  that 
the  economy  has  deviated  from  trend,  what  are  expectations  regarding  its  future  path?  Next,  for  W{t)  =  0, 
what  is  the  probability  that  at  time  t -r  m,  the  economj'  will  return  to  trend,  W[t  -i-  m)  =  0?  What  is  the 
average  waiting  time  untD  such  a  return  to  trend?  The  average  cycle  length  would  then  be  twice  this  average 
waiting  time. 

For  the  process  here,  the  conditional  expectation  E\W{i-r  m)  \  W{t)]  is  just  W[t),  for  all  m  >  1.  More 
to  the  point,  when  W[t]  =^  0,  the  conditional  expectation  of  future  W[t+  m)  is  bounded  away  from  zero 
forever.  If  c  is  serially  correlated,  the  exact  pattern  for  the  conditional  expectations  of  future  W  changes 
somewhat.  However,  provided  e  does  not  have  a  spectral  density  that  vanishes  at  frequency  zero,  the  main 
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conclusion  that  the  process  is  never  expected  to  return  to  trend  holds.  Mean  reversion,  or  trend  reversion, 
in  this  expectation  sense,  only  has  to  do  with  whether  W  has  a  unit  root,  not  whether  the  fijst-difference 
sequence  is  serially  correlated.  Campbell  and  Mankiw  (1987)  interpret  this  to  mean  that  there  should  be  no 
presumption  that  an  economy  should  return  to  its  natural  rate. 

What  is  the  likelihood  that  the  economy  does  return  to  trend  at  some  future  time  t-'rTnl  For  the  simple 
example  here,  this  probability  is  not  difficult  to  calculate.  Starting  on  trend  at  W[t)  say,  there  is  a  return 
to  trend  at  time  i  +  m=f+2nif  W[t  +  m)  =  W[t),  The  probability  of  each  possible  sample  path  with 
W[t  +  2n)  =  W[t)  is  (1/2)"  •  (1/2)":  this  happens  when  there  are  n  +l's  and  n  -I's  in  2n  independent  draws 
from  the  distribution  of  e.  However  there  is  more  than  one  such  path  where  W[t-\-  2n)  =  W[t).  The  total 
number  of  different,  and  therefore  mutually  exclusive,  paths  with  W[t  +  2n)  =  W[t)  is  simply  the  binomial 
coefficient  (  ^)  =  (2n!)/(n!)^.  The  probability  of  a  return  to  trend  in  2n  periods  is  therefore: 

P2„  =  (^J")  •  (1/2)=",  for  n  >  1. 

Defining  Pq  =  1,  the  probability  generating  function  for  a  return  to  trend  is: 

p[s)  =  f;  P2„.^ 

n=0 

where  f  is  a  dummy  variable.  (The  reader  who  is  unfamiliar  with  these  kinds  of  calculations  is  referred  to 
William  Feller,  1968,  Ch.  13.)  But  then  by  Newton's  binomial  formula, 

^w=i:Cr)-(i/2)=^"-^"'=(i-^')"'^'- 

This  function  diverges  to  infinity  as  the  variable  s  tends  to  unity.  This  divergence  is  significant  because  we 
can  now  state  the  following:  Even  in  the  presence  of  a  vnit  root,  the  economy  returns  to  trend  infinitely 
often  witi  probabilirj- 1.  In  any  time  period,  no  matter  how  far  the  economy  has  wandered  from  trend,  the 
probability  is  always  positive  of  a  return  to  trend  some  time  in  the  future. 

But  this  is  a  relatively  weak  sense  in  which  shocks  to  a  unit  root  economy  are  not  permanent.  To  see 
why  this  is,  we  turn  to  the  final  question  posed  above.  What  is  the  average  time  between  returns  to  trend? 

Let  the  probability  of  a  first  return  to  trend  in  2n  periods  be  denoted  Q^n-  Then  the  sequence 
{Qzn,  «  >  1}  satisfies: 

n 

P2n  =  53  Q2:P2{n-j),  for  n  >  1. 
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'XL*  right  hand  side  is  Bimply  the  Bum  of  probabilities  of  n  mutualiy  exclusive  events.  For  each  j,  the 
representative  term  Qay  P2(n-})  >*  '■he  probability  of  a  first  return  to  trend  in  j  periods,  followed  by  another 
return  in  2(n  —  j)  periods.  Taken  together,  this  collection  of  disjoint  events  is  simply  the  event  that  there  is 
a  return  to  trend  in  2n  periods,  the  probability  of  which  has  been  computed  above. 

Multiplj'ing  both  sides  by  e^",  summing  over  n  >  1,  and  utOizing  a  fundamental  property  of  convolu- 
tions, the  equation  above  becomes: 

P{a)-l=Q{s)P{s), 

where  Q{s)  has  been  defined  as  l^^j  Q2nS^"-  The  average  waiting  time  for  a  return  to  trend  is  then 

OO 

E  \2n  \W[i  +  2n)  =  W(t)  =  0,  W{t  +  ;)  7^  0,  1  <  ;  <  2n]  =  X]  2n  •  Qzn 

n=l 

=  lim  — — . 

»— 1  ds 


But,  we  also  have: 


which  implies 


(?(.)  =  l-P(.)-^  =  l-(l-.^)^/=, 


^  =  s(l  -  «2)-^/=  -  +00  as    a  -  1. 
ds 

Thus,  tie  averag-e  length  of  a  cycie  in  a  unit  root  economy  is  infinite. 

Therefore,  if  business  cycles  are  defined  to  be  fluctuations  about  trend,  a  unit  root  economy  will  not 
have  business  cycles  of  average  length  equal  to  the  50  months  given  by  conventional  wisdom. 

In  summarj',  this  section  has  made  precise  the  sense  in  which  unit  roots  and  business  cycles  may 
be  %'iewed  as  inconsistent.  As  far  as  I  know,  this  is  the  first  discussion  where  these  difi'erent  notions  of 
persistence  and  business  cycles  have  been  explicitly  related  to  the  presence  of  unit  roots.  (Although  again, 
in  the  empirical  literature,  Diebold  and  Rudebusch,  19S7,  have  taken  steps  in  this  direction.)  Typicallj', 
it  is  taken  for  granted  that  the  conditional  expectation,  or  more  accurately,  the  impulse  response  function 
is  a  sufficient  measure  for  persistence.  That  the  distinction  between  sample  path  behavior  and  that  of  the 
conditional  expectation  sequence  is  important  is  highlighted  in  the  analysis  of  the  next  section. 
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n.  A  Model  with  Unit  Roots  and  (Finite  Length)  BusineeB  Cycles. 

This  section  develops  a  simple  model  where  aggregate  output  is  linearly  represented  by  a  unit  (or  explosive) 
root  process,  but  nevertheless  output  fluctuations  about  a  deterministic  trend  have  finite  average  length. 
The  model  itself  is  not  complicated;  however  its  dynamic  properties  will  need  to  be  studied  by  using  ideas 
that  may  be  unfamiliar.  Thus  we  first  analyze  a  sequence  of  examples  to  build  intuition. 

There  are  two  papers  related  to  the  discussion  of  this  section.  James  Hamilton  (1987)  presents  a  careful 
empirical  investigation  of  stochastic  nonlinearities  in  aggregate  output.  His  model  is  substantively  different 
from  that  in  this  paper,  although  both  share  the  feature  that  there  is  a  discreteness  in  output  growth. 
Technically,  we  will  show  below  that  a  random  sequence  can  be  strictly  stationary  with  fluctuations  of  finite 
average  length,  even  if  it  has  the  conditional  expectations  behavior  of  a  unit  root  process.  Daniel  Nelson 
(1987),  in  a  different  context,  has  also  constructed  strictly  stationary  unit  root  sequences.  The  mechanism 
that  produces  stationarity  in  his  work  are  quite  different  from  that  used  here.  Taken  together  however,  this 
suggests  that  there  may  be  other  examples  of  stochastic  sequences  that  have  a  unit  root  but  nevertheless 
remain  strictly  stationarj'. 

The  examples  here  draw  on  a  generalization  of  a  process  that  first  appeared  (as  far  as  I  know)  in  work 
by  Olivier  Blanchard  (1979).  He  does  not  establish  any  properties  for  the  special  case;  we  do  so  now  for  the 
generalization. 

A.  Some  TechDica.!  Results. 
Let  X{t)  evolve  as 

X{t)  =  n{t)X{t-i)~v[t) 

where  [-i,rj)  is  iid,  with  t  independent  of  tj,  E')  =  T,  Er]  =  0.  Suppose  also  that  |r|  >  1,  while 

Pr(7(l)  =  0)  >  0. 

This  can  be  revkTitten  as  follows: 

X[t)  =  YX{t  -  1)  +  [(7(i)  -  r)  X[t  -  1)  +  r?(t)] 

=>A'(t)  =  rA'(f-i)  +  £(t) 
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Notice  that  while  not  independent,  €{t)  turns  out  to  be  uncorrelated  with  X{t—  l),  and  in  fact,  is  orthogonal 
to  all  lagged  A"8.  This  fact  allows  us  to  conclude  that  the  equation  immediately  above  is  a  regression,  and 
that  r  is  the  population  regression  coefficient.   Thus  X(t)  is  a  stochastic  process  that  has  explosive  roots; 
when  r  =  1,  A'  has  a  unit  root. 

When  Pr(7(l)  =  0)  >  0,  as  we  assume  here,  it  will  be  convenient  to  call  X  a  clinging  process.  The 
reason  for  this  is  that  no  matter  how  small  this  probability  is,  we  have  the  following  property: 

Theorem  1    (Stationarit>').   A  clinging  process  is  strictly  stationary,  and  has  mean  zero  and  inBnite 
variance. 

Proof:  Appendix. 

While  X  has  a  linear  regression  representation  that  appears  explosive  (in  Box-Jenkins  terminology,  it 
does  not  have  a  stationarj'  ARM  A  representation),  it  is  actually  stable:  its  joint  unconditional  distributions 
are  time-invariant  and  are  independent  of  initial  conditions.  Further,  as  is  clear  from  the  arguments  in  the 
Proof,  X  displays  fluctuations  about  its  mean  that  have  finite  average  length. 

Next  consider  any  distributed  lag  function  of  X,  where  the  lag  distribution  is  stable  (Le.,  the  lag 
distribution  has  no  ceroes  inside  the  unit  circle).  The  resulting  random  sequence  is  again  strictly  stationary, 
and  again,  has  an  'explosive'  linear  autoregressive  representation.  But  now  the  transformed  clinging  sequence 
wiU  display  much  richer  dynamics  as  well. 

Without  a  distributed  lag,  A'  displays  much  sharper  returns  to  its  mean,  than  movements  away  from 
the  mean.  Thus  a  sample  path  analysis  may  uncover  evidence  of  asymjnetry.  However,  conditional  on  X[t) 
and  7(i),  X{t  -f  1)  is  sj'mmetrically  distributed  provided  that  rj  is  symmetrically  distributed.  Since  the 
effects  of  initial  conditions  vanish  eventually,  provided  that  the  accumulated  sequence  in  t?  is  dominant,  X 
will  also  unconditionally  have  a  symmetric  distribution.  Next,  a  clinging  sequence  will  display  cycles  (in  the 
sense  of  returns  to  a  neighborhood  of  its  mean)  persistently  and  infinitely  often.  As  mentioned  above,  these 
cycles  turn  out  to  have  finite  average  length.  If  a  time  trend  is  added  to  a  clinging  sequence,  this  example 
would  reconcile  a  number  of  seemingly  contradictor}'  findings:  J.  Bradford  DeLong  and  LavTence  Summers 
(1985)  find  that  after  detrending,  business  cycles  are  actually  'symmetric'  in  unconditional  distribution.  In 
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opposition  to  this,  Salih  Neft(i  (1984)  concludes  that  certain  features  of  economic  fluctuations  do  display  a 
kind  of  asymmetry.  Campbell  and  Mankiw  (1987)  and  Nelson  and  Plosser  (1982)  find  evidence  for  a  unit  root 
in  linear  representations  for  output;  however  it  also  a  wide-spread  belief  that  fluctuations  are  persistently 
recurring  events.  If  the  clinging  model  is  correct,  these  statements  are  not  inconsistent. 

The  conclusion  from  this  discussion  is  that  the  presence  of  unit  roots  alone  is  not  evidence  against 
well-behaved  finite  length  business  cycles.  Further,  in  the  sense  that  the  joint  distributions  are  independent 
of  initial  conditions,  disturbances  do  not  persist  indefinitely  even  in  the  presence  of  unit  roots. 

The  question  that  remains  is  the  following:  Is  there  a  plausible  economic  mechanism  that  will  produce 
a  clinging  sequence  for  output  in  equilibrium?  We  turn  to  this  next. 

B.  A  Simple  Economic  Model. 

The  model  to  be  described  here  relates  employment,  learning-by-doing,  and  output  growth.  Output  is 
produced  with  two  factors  of  production:  capital  K  and  labor  N.  In  any  time  period  t,  the  measured  stock 
of  labor  comprises  skiDed  and  unskilled  employees: 

Nit)  =  No{t]  +  Ni{t). 

Skilled  labor  A^o  is  labor  that  has  been  employed  for  at  least  one  period.    Unskilled  labor  Ni  comprises 
new  entrants.  We  make  the  extreme  assumption  that  only  skilled  labor  is  productive;  unskilled  labor  is  in 
training  and  is  completely  unproductive  in  the  apprenticeship  period. 
Skilled  labor  evolves  as  follows: 

A'o(f)=minlA-(J-l}-A2(;),  Oj, 

where  N^it]  is  the  labor  withdrawn  in  period  t.  This  would  include  factors  such  as  retirements,  a  reduction 
of  market-wide  skills  due  to  the  introduction  of  new  technologj',  or  possibly  sectoral  shifts  between  industries, 
so  that  the  average  skill  level  falls.  This  equation  states  that  skilled  labor  this  period  is  equal  to  skilled  labor 
last  period  minus  any  decumulation  due  to  labor  withdrawals;  at  worst,  skilled  labor  falls  to  zero. 

For  simplicitj',  we  assume  K^  and  Ni  have  the  same  expectation,  so  that  on  average,  the  number  of  new 
entrants  just  equals  the  number  of  withdrawals.  Also  assume  that  both  A'2  and  A^i  are  serially  independent 
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and  to  rule  out  uninteresting  degeneracies,  we  assume  A'j  and  A'^i  have  strictly  positive  variances.    (The 
arguments  below  become  a  little  more  complicated  and  subtle  if  these  don't  hold,  but  the  main  substantive 
conclusions  can  be  made  to  remain  intact.) 
We  then  have  the  following  result: 

Theorem  2  (Eventual  Lobs  of  all  Skilled  Labor).  For  an  &Tbitr&ry  initial  quantity  of  skiUed  labor 
No[0)  >  0,  tJie  probability  is  positive  that  at  some  period  t  >  0,  there  is  a  totai  loss  of  skilled  labor: 
No{t)  =  0. 

Proof:  Appendix. 

Thb  is  not  a  deep  result:  it  says  that  it  will  be  unusual  to  come  upon  situations  where  the  stock  of  labor 
is  always  productive.  One  suspects  that  small  deviations  from  the  assumptions  here  will  leave  the  major 
conclusion  unmodified.  Notice  that  the  result  allows  the  variation  in  withdrawals  ^'2  to  be  arbitrarily  small. 
If  labor  entrants  on  average  exceed  withdrawals,  then  the  result  maj'  not  hold,  as  there  is  then  a  positive 
drift  in  the  stock  of  labor.  However  for  population  growth  rate  not  too  different  from  zero,  the  difference 
must  be  small,  so  that  equality  of  the  two  means  is  a  good  approximation.  Further,  most  would  agree  that 
aggregate  demographic  factors  cannot  be  too  much  related  to  economic  fluctuations.  Thus  a  nongrowing 
labor  force  is  probably  the  natural  assumption  to  use  here. 

The  rest  of  the  model  economy  is  trivial.  Let  the  production  function  be  such  that  (the  logarithm  of) 
output  is  described  by  Y{t)  =  No[t)  K[t)  -r  T7(t),  where  rj  is  a  technology  shock.  Capital  decays  completely 
after  one  period;  however  there  is  no  consumption,  and  all  output  is  invested,  so  that  K[t)  =  I'(i  —  1). 

Thus  the  model  displays  high  persistence  in  output  when  A*o(t)  exceeds  1.  Given  a  technicaUy  skilled 
work  force,  output  evolves  as  a  highly  positively  correlated  sequence.  Productivity  shocks  that  increase 
output  this  period,  Tj[t)  >  0,  tend  to  increase  output  persistently  in  the  near  future.  Increased  output 
implies  a  higher  capital  stock;  with  skilled  labor,  this  increase  in  output  is  transmitted  to  increased  output 
in  future  periods  as  well.  By  Theorem  2  however,  no  matter  how  small  the  fluctuations  in  withdrawals 
from  the  labor  force,  at  some  stage,  the  economy  is  sure  to  lose  all  its  skilled  labor.  When  this  happens, 
the  accumulated  capital  is  "useless",  except  to  train  labor  for  the  next  period.   In  this  case,  the  effects  of 
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productivity  shocks  are  completely  transitory. 

The  model  therefore  quite  easily  produces  a  process  for  output  that  is  (close  to)  a  clinging  sequence. 
The  crucial  property  that  the  random  coefficient  on  lagged  output  periodically  realizes  as  tero  is  borne  by 
our  assumptions  on  the  process  followed  by  skilled  labor.  The  model  has  quite  reasonable  assumptions;  we 
see  that  no  very  special  feature  is  needed  to  produce  persistently  recurring  short-lived  fluctuations  despite 
the  possible  existence  of  a  unit  root  linear  representation.  Slight  variations  in  the  assumptions  on  capital 
decay  can  be  accommodated  by  the  fact  that  distributed  lag  transformations  of  a  clinging  sequence  leave  its 
key  properties  unchanged.  The  fact  that  the  loss  of  all  of  the  skilled  labor  force  is  technically  a  key  factor 
in  producing  the  result  should  not  detract  from  the  plausibility  of  the  effects  here.  (As  an  example  on  a 
smaUer  scale,  consider  a  department's  loss  of  a  senior  econometrician.  Even  if  he  or  she  is  one  among  a  few 
and  the  loss  occurs  only  for  a  short  period  of  time,  it  may  sufficiently  slow  the  flow  of  graduate  students  in 
that  field  that  the  loss  is  felt  for  several  generations  hence.) 

In  Eummarj",  we  deEcribe  again  informally  the  characteristics  of  the  model.  Productivity  shocks  both 
have  a  persistent  long-run  effect  (on  average),  and  also  an  eventuaUy  transitory  effect.  The  persistence 
comes  from  a  feedback  between  capital  and  output.  Increased  output  leads  to  a  high  capital  stock.  With 
a  highly  skilled  labor  force,  this  produces  even  higher  output  in  the  next  period.  The  transitoriness  comes 
from  the  fact  that  the  slight  (exogenous)  variation  in  labor  withdrawals  and  entrants  leads  periodically  to  a 
completely  unskilled  labor  force  that  needs  some  length  of  time  to  become  re-trained.  Given  an  economy-wide 
low-skilled  labor  force,  no  amount  of  beneficial  productivity  shocks  is  immediately  persistent  in  increased 
output.  Under  these  circumstances,  productivity  shocks  have  a  transitorj"  effect.  Of  course,  this  effect  is 
again  only  temporary  for  the  skiDed  labor  force  v.-ill  come  online  again  sometime  in  the  future.  Thiis  in  a 
low  output  period,  the  capital  stock  is  only  apparently  unproductive;  it  serves  to  tool  up  the  labor  force 
for  subsequent  periods.  This  example  has  therefore  embedded  the  djTiamics  of  a  clinging  sequence  in  an 
economic  setting:  we  see  that  it  is  in  fact  quite  plausible  to  model  output  as  a  clinging  sequence. 

The  next  section  is  empirical:  it  attempts  to  obtain  some  evidence  on  whether  the  kinds  of  effects 
described  here  are  present  in  actual  aggregate  measures  of  output. 
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TTT.  Some  Empirical  Evidence. 

The  clinging  sequence  above  produces  a  meaningful  distinction  between  'hard'  and  'soft'  unit  roots.  The 
first  kind  of  unit  root  describes  the  processes  of  Section  I;  the  second  describes  those  of  Section  II.  We  now 
present  some  summary  statistical  evidence  on  distinguishing  the  two. 

Let  Y(t)  be  (the  natural  logarithm  of)  some  measure  of  aggregate  output.  The  null  hypothesis  is  that 
Y  contains  a  hard  unit  root: 

'  Y(t+1)  =  fi-\-Y[t)+A{L)-\(t  +  l),  fort>0, 

where  rj  is  Ud    A'(0,o-^).  Defining  W{t]  l'(0],^)  =  Y{t)  -  Y(0)  -  ^  t,  this  can  be  rewritten: 

W{t+1;  Y{0)J)  =  W(t-  Y{Q),fi)  +  A(Lr'r,{t+ 1) 

Next,  let  X[t)  =  A[L)W{t),  so  that: 

X{t;Y[0),^)  =  X{t-l;Y{0),P)-^v{t). 

The  alternative  hj'pothesis  that  Y  contains  a  soft  unit  root  is  most  concisely  expressed  as  follows.  In 
the  definition  of  W,  substitute  ^o  ui  place  of  y(0),  where  ;So  is  a  free  parameter.  Then  write 

X{t)='j{t)X{t-l)  +  v{t), 

where  7,  r;  are  pairwise  and  serially  independent,  rj  is  A'(0,  r^),  E'l  =  1,  Pr(7  =  0)  =  p  >  0,  and  X  is  denned 
exactly  as  above. 

In  woroE.  a  filtered  version  of  deviations  from  trend  Y{D)  —  ;?t  is  hj-pothesized  under  the  null  to  be 
a  pure  random  walk  with  no  drift.  Under  the  alternative,  the  same  filtered  version  of  trend  deviations  is 
hj-pothesized  to  be  a  clinging  sequence  with  a  soft  unit  root.  The  difference  stationary  model  is  strictly 
nested  within  the  clinging  alternative:  that  null  hypothesis  model  obtains  by  setting  the  variajice  of  7  to 
zero  and  ^60  to  Y{0). 

Estimation  under  the  null  proceeds  as  follows.  First  difference  (the  log  of)  observed  output,  and  estimate 
the  regression 

Y{i)  -  Y{t  -  1)  =  fiA{l)  -  [L-'A{L)]     [Y{t  -  1)  -  Y[t  -  2))  +  v[t). 
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The  model  is  just  identified.    Under  our  assumptions,  ordinary  least  squares  is  equivalent  to  maximum 
likelihood.  Estimation  under  the  alternative  is  a  little  more  complicated.  We  parametrize  7  as  the  product 
of  serially  and  pairwise  independent  Bernoulli  and  normal  random  variables: 

'y(t)=fc(t).c(i), 

where  b  is  Bernoulli  with  probability  p  of  equalling  unity,  e  is  M[p~^,  c^).  Setting  trf  to  eero  and  p  to  unity 
recovers  the  property  implied  by  the  null  hypothesis  that  7  =  1  for  all  t.  The  likelihood  function  for  this 
alternative  model  can  be  written  in  a  way  close  to  that  of  an  unobserved  switching  regression  model  Call  the 
vector  of  parameters  in  the  model  6  ;  the  likelihood  of  the  i—th  observation  on  X[t)  conditional  on  X[t—  1) 
is 

t{X(t)\X{t-l);e)=p-J  ^^l'^-i'f^'-%]+{l-p).4>[X[t)/a,.). 

where  (f>  is  the  density  of  a  standard  normal  The  likelihood  of  the  entire  sample  conditional  on  A''(0)  is  seen 
to  be  simply  the  product  of  these  conditional  likelihoods.  Since  X  is  strictlj'  stationary  under  the  alternative, 
the  impact  of  the  initial  condition  A''(0)  vanishes  asymptotically.  Parameter  estimates  obtained  under  the 
nuU  are  used  as  starting  points  for  the  maximum  likelihood  observation. 

Tables  1  and  2  display  estimates  of  the  model  for  two  measures  of  aggregate  output,  quarterly  GNP  and 
monthly  industrial  production.  Results  are  presented  for  A[L)  chosen  to  be  Erst  through  third  order;  that 
is,  A[L)  =  y^  _o  ajL'  where  ao  is  always  1  and  k  varies  from  1  through  3.  The  GNP  series  is  measured  in 
1982  dollars  and  industrial  production  is  seasonally  adjusted. 

There  are  three  restrictions  under  the  null:  ^0  =  5'(0),  p  =  1,  and  rf  =  0.  Thus,  twice  the  difierence  in 
log-likelihoods  is  asymptotically  distributed  x^  '*'i*'^  three  degrees  of  freedom.  For  GNP,  this  test-statistic 
takes  values  2.614,  3.S42,  and  1.532  with  marginal  significance  levels  of  0.46,  0.28,  and  0.67,  respective!}', 
when  A  is  set  to  first,  second,  and  third  order  lag  polynomials.  Therefore,  the  nuU  hypothesis  that  GNP  is 
difi'erence  stationarj',  rather  than  trend  stationary  cannot  be  rejected.  GNP  has  a  hard  unit  root. 

Turning  to  the  industrial  production  data  in  Table  2,  the  nuD  hypothesis  of  a  hard  unit  root  is  now  seen 
to  be  soundly  rejected.  For  second  and  third  order  polynomials,  the  differences  in  log-likelihoods  is  around 
50,  with  associated  marginal  significance  levels  well  below  the  standard  cutoff  points. 
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Examining  the  resulte  more  closely,  we  see  that  individually  the  probability  p  is  not  significantly  different 
from  1,  and  that  cr^  is  not  significantly  different  from  «ero.  Thus  the  reason  for  rejection  appears  to  come 
mainly  from  the  ^o  intercept  term.  However,  the  estimates  for  these  three  parameters  are  correlated  and  so 
the  individual  f-statistics  may  be  misleading.  Testing  for  the  significance  of  all  three  is  essentially  equivalent 
to  examining  the  likelihood-ratio  statistic,  which  is  what  we  have  done  here.  Evidently,  allowing  for  the 
possibility  that  1  —  p  and  a^  may  differ  from  0  is  sufficient  to  pin  down  the  intercept  term  quite  precisely. 
But  it  is  one  of  the  important  implications  of  the  hard  unit  root  hj^pothesis  that  the  intercept  in  the  "trend" 
line  changes  for  every  different  initial  condition.  More  precisely,  OLS  estimates  for  this  intercept  diverge  as 
the  sample  size  grows  arbitrarily  large  (see  for  example  Durlauf  and  Phillips,  1986).  Our  results  indicate 
that,  contrary  to  this,  for  our  sample  of  industrial  production  data,  the  intercept,  and  consequently  the  trend 
line,  is  in  fact  quite  precisely  estimated. 

A  plot  of  the  likelihood  function  for  industrial  production  (not  presented  here)  shows  a  distinct  and 
unique  peak  in  p  at  0.99.  This  implies  a  realization  of  ■7(i)  =  0  everj'  91  months  on  average.  This  is 
approximately  twice  the  consensus  length  of  a  business  cycle.  However,  notice  that  A{L)  is  a  further  source 
for  cyclical  dynamics.  In  the  second  order  case,  this  polynomial  has  roots  of  0.65  and  0.16.  These  roots  are 
weD  inside  the  unit  circle,  and  serve  to  enrich  the  estimated  cyclicality. 

rV.  Conclufiion 

This  paper  has  undertaken  two  tasks.  The  first  and  pedagogical  objective  in  Section  I  makes  clear  why  unit 
root  processes  and  the  notion  of  persistently  recurring  business  cycles  are  at  odds  with  each  other.  The 
analysis  in  terms  of  the  relation  between  sample  path  behavior  and  the  presence  of  unit  roots  appears  not 
to  have  been  made  explicit  thus  far  in  the  macroeconomic  literature. 

That  it  is  important  to  make  this  connection  becomes  clear  in  Section  11  where  a  simple  model  is  con- 
structed that  displays  persistently  recurring  fluctuations  in  the  sample  paths  of  output  despite  the  presence 
of  a  unit  root.  Summarj"  statistical  evidence  presented  in  Section  III  suggests  that  the  effects  described  in 
the  model  are  not  without  empirical  basis  for  measures  of  aggregate  output  in  the  US. 
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Appendix. 

This  appendix  proves  the  technical  resultB  in  the  paper. 

Proof  of  Theorem  1.  We  verify  that  the  joint  distribution  o/ subsequences  of  X  is  invariant  across  time 
shifts.  Fork  >  1,  Jet  ti,  <2,  •  •  • ,  tfc  ^e  some  collection  of  integers,  assumed  increasing  without  loss  of  generality. 
We  wish  to  show  that  for  all  integer  a: 

F{Xit,),X{t2), ...,  X[t,))  =  F{X{t,  +  s],  X{t,  +  s),...,  X{t,  +  0). 

wJiere  F  denotes  the  joint  d'lstribution  function.  From  the  form  of  the  clinging  sequence,  it  suffices  to 
estabL'sii  equaiity  of  the  unconditional  distributions  of  A''(ti)  and  X(ti  +  s).  Let  m,  M  be  integer  with 
M  >  1.  Iterating  on  X{m)  =  '■i(m)X{m  —  l)  +  r7{m),  we  iiave 

(m  \  m  — 1  I       rn  \ 

n  i{j)\  x{m-M)+vH+  Yi     n  iWjvu). 
j  =  m-rl-M  J  j=m  +  l-M    \k=}-^-l  J 

If  ')[]')  -i-anishes  for  some  j  between  m~  \  —  M  and  m,  X[m)  becomes  independent  of  X[m  —  M).  In  that 
case,  X[m]  can  be  written  entirely  in  terms  of  the  (7,  tj)  partial  sequence  between  epochs  m  +  1  —  M  and 
m.  Setting  m  to  tj  and  fi  +  s  in  turn,  we  see  that  the  distributions  0/ A''(ti)  and  A'(ti  +  s)  are  identical, 
conditional  on  a  zero  realisation  for  7(y),  for  some  j  between  m-rl  —  M  and  m.  More  explicitly,  if  for  some 
J,  0<  j  <  M,  we  have  7(fi  —  ;)  =  '^(tl  -r  s  —  /)  =  0,  tien 

F{X{t,],X{i^],...,Xit,))  =  F[X[t,^s),X[t,-rs),...,X(i^~s)). 

Let  p  denote  Pr(-r[0)  =  0),  witi  p  >  0.  Tie  condizioning  event 

{^(-i  —  y)  =  7(^1  ~  *  ~  y )  =  0,  for  some  y  between  0  and  M  —  1} 

occurs  with  probability  bounded  from  below  by  1  —  (1  —  p^)^.  But  this  tends  to  1  as  M  —•  oc,  hence 
establishing  strict  stationarity.  To  show  that  EX  is  zero,  write  for  r.  >  m  >  0; 

f   "  ^ 

DeEne  ^rn.r.  to  be  this  conditional  expectation:  it  is  a  well-de£ned  random  -vBxiabie  as  it  is  simply  a  finite 
product  of  random  variables.   Since  7  is  iid  and  places  positive  probability  on  0,  for  fixed  m  the  random 
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sequence  {fm.m  i  ^  rn)  converges  almost  Burely  to  the  degenerate  ra.ndom  variable  at  0.    By  the  iaw  of 
itera.ted  expectations,  £(lim„_oo  lm,n)  =  0  uniformly  in  m  which  implies  E{X[t))  =  0.  The  infinite  variance 
property  is  obvious.  QED 

Proof  of  Theorem  2.  Suppose  tha.t  contrary  to  the  conclusion  of  the  theorem,  N{t)  >  0  for  all  t.  But 
then  N  is  a  tero  drift  random  walk.  Thus,  from  any  starting  point,  N  will  take  on  arbitrarily  negative  values 
inEnitely  often  with  probability  1.  This  is  a  contradiction  as  N  is  restricted  to  be  always  nonnegative.  QED 
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FOOTNOTES. 

^  Department  of  Economics,  MIT,  Cambridge  MA  02139.  I  thank  Olivier  Blanchard,  Stanley  Fischer,  James 
Poterba  and  Julio  Rotemberg  for  useful  comments  on  earlier  versions  of  this  paper.  I  also  thank  John 
Taylor,  Mark  Watson  and  an  anonymous  referee  for  suggestions  and  criticisms  that  have  helped  to  focus  the 
discussion.  The  hospitality  of  the  MIT  Statistics  Center  is  gratefuDy  acknowledged. 
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